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(57) Abstract 



Content Lifecycle Management (CLM) is a method for keeping content within an intranet (or internet) up-to-date and relevant for 
users of the website by archiving low-value, unused, inactive, and obsolete content from the site based on a combination of ratings threshold, 
usage threshold, and expiration date. Archiving content removes general access to that content, but retains it in the site for searches specific 
to the archive. CLM uses these threshold values as a method for providing automated feedback and notification to the publisher and/or 
author of aging or poorly performing content. 
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ONLINE CONTENT LIFECYCLE MANAGEMENT BASED ON USAGE STATISTICS, USER-SUPPLIED 
VALUE RATINGS AND EXPIRATION DATES 



Field of the Invention 

The invention pertains to a system for maintaining and accessing a diverse 
collection of information to identify and keep information that is current and useful. 

Background of the Invention 

As information databases become more prevalent and more complete, a 
number of content management issues must be confronted. In particular, valuable 
data often become "lost in the haystack" of lower-quality content. In addition, 
systems rarely remove stale content or provide meaningful feedback to authors or 
publishers of the value of their content. It is an object of the present invention to 
address these issues by providing methods for winnowing databases to preserve high- 
quality content while archiving or deleting low-quality or stale content. 

Summary of the Invention 

In one aspect, the invention pertains to a system for maintaining currency and 
value for a collection of information. When a piece of information (a content item) is 
added to the collection, metadata is added to a database specifying the location of the 
information, and threshold values for usage and utility of the information. When the 
database item is accessed to access the information, the user is allowed to specify a 
value rating for the content item, and an access log is updated. The frequency of 
access of the content item and its value can thus be monitored to determine if either 
has fallen below the associated threshold value. If either usage or value rating falls 
below its threshold, the system performs some action, such as warning the author of 
the information that it is not being used or is received low ratings, marking the item 
for archiving, archiving the item, or deleting the item from the system. Content items 
may further have associated expiration dates which are given at the time that the 
content item is added to the collection. If a content item is approaching its expiration 

1 
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date, the system may notify the author or perform another action such as archiving the 
item. The system can be used for monitoring diverse collections of information, 
including such items as URLs and web pages, files stored on a fileserver, whole 
directories and subdirectories, and nondigital data such as books. 

5 

Brief Description of the Drawing 

The invention is described with reference to the several figures of the drawing, 
in which, 

Figure 1 shows a sample screen containing content and a ratings entry area; 

10 and* 

Figure 2 shows the content matrix. 

Detailed Description 

Websites today are faced with daunting content management issues including 

15 the proliferation of old and stale content and poor capabilities of providing feedback 

and notification to the author. Content Lifecycle Management (CLM) is designed to 
eliminate this proliferation by keeping content that is displayed on an intranet site 
relevant and up-to-date. This goal is accomplished by tagging each item of content 
recorded in the content management database with an expiration date and threshold 

20 values. Threshold values are minimally acceptable levels in order for content to 

remain current and available. CLM uses at least 2 thresholds: one for the rating (value 
of the content) and one for the usage ( how often each item is viewed.) 

When content is published to the intranet, it is tagged with content thresholds 
for ratings and usage which the author or publisher feels represent minimum values^ 

25 When content falls below either value based on a given timed period (for example, 

weekly basis), the CLM will notify the author of this fact, for example by e-mail, 
from within the CLM application, or both. The tabulated values which are required 
for this are collected from users who view and rate the content on the site. Figure 1 is 
a screen shot illustrating how feedback is collected. Each time a user views a 

30 document (or item of content) it is registered in the usage table. When a user rates the 

content using the ratings engine 10 at the top of the window, it is also recorded 
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associating the content ID with the user ID, date, and rating. Rating of content may 
be either optional or required for the user. 

The maximum and minimum values of the two thresholds form the CLM 
matrix, shown in Figure 2. Based on the rating and usage values, each content item 
5 can receive one of four classifications: ( 1 ) active or good content 20, which means 

that this item of content is receiving high usage and that users find it of high value; 
(2) needs promotion 22, which means that users value the content but usage is low 
(usually because users can not easily find the content); (3) needs updating 24, which 
means that users access the file frequently but that it is not of significant use or value; 
1 0 and (4) archive content 26, which is the default action for content that is not being 

viewed often and is of low value. 

The ratings and usage of content in combination with its expiration date serve 
as an effective indicator for archiving content. 

The inventive system for maintaining currency of the information dovetails 
1 5 well with systems for targeting information to particular users or groups of users. For 

example, the methods described herein may be used with the methods described in 
U.S. Provisional Application No. 60/129,106, filed April 13, 1999, and U.S. Patent 
Application "Group Targeted Content Personalization," (attorney docket 2001774- 
0001), filed on even date herewith. 

j 20 

Example 

In one embodiment of the invention, Content Lifecycle Management is based 
I on a set of database tables that are part of an overall intranet database. The following 

! 

data are cataloged in the database tables: 
25 • Content information - may include ID, name, file name, content type, 

publish and expiration dates, author/publisher, approval and archive status, and 
| ratings/usage thresholds for each content item 

i • Ratings - may include content ID, user ID, date, rating, and module in 

which the content was rated 
30 • Usage - may include content ID, user ID, date, and module in which the 

content was viewed 
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• Author/Publisher Info - may include e-mail address for notification by e- 

mail 

The following data are stored on durable media such as a hard disk: 
5 • Content, which may include documents, files, data, executables, and 

catalogue entries or other references to nondigital content. 

• Archiving Daemon (code) 

• Low-Usage Daemon (code) 

• Low-Rating Daemon (code) 
10 • • CLM Matrix Daemon (code) 

Content Publishing 

Content is published to the intranet using the publishing functionality within 
the system. During the publishing process, the file may be uploaded and stored on the 

15 server and all file-specific and publisher-entered information as well as the 

association between the file and the database entry is saved in the database tables. 
Data specific content such as web links (URLs) are stored in the database and the 
information is associated with the database entry. The user-entered information may 
include items such as the classification of the document using the topic and subtopic 

20 parameters, access classification (who has access to view the document), target 

classification (who would most benefit from this content), expiration date and the 
rating and usage thresholds for the content. (Access and target classification are 
discussed more fully in U.S. Provisional Application No. 60/129,106, filed April 13, 
1999, and U.S. Patent Application "Group Targeted Content Personalization," 

25 (attorney docket 200 1 774-000 1 ), filed on even date herewith). 



Archiving Daemon 

The Archiving Daemon comprises executable code which is run as a 
scheduled task on a periodic basis (set by the administrator). In one preferred 
30 embodiment, it is set to run on a nightly basis. Each time the archiving daemon is 

run, it makes two comparisons. First, it compares the expiration date of each record in 
the content library with the current date. If the current date matches the expiration 

4 
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date, the record is archived and an e-mail is sent to the author notifying the author that 
the record has been archived. At this point, the content that has been archived is no 
longer available throughout the site unless a search on archived content is executed. 
If the current date represents a period of a week before the expiration date (or 
5 any other suitable time period), the archiving daemon sends an alert e-mail to notify 

the author that the content is aging and will be retired at the end of the week unless it 
is updated. 

Low-Usage & Low-Rating Daemons 

10 * These daemons are used to notify the author if content the author has 

published falls below the rating and usage thresholds set when the content was 
published. Like the Archiving Daemon, they may be set to run on a periodic basis 
(e.g., nightly). These Daemons may further directly archive content which falls 
below one of the thresholds (e.g., if the author/publisher does not respond to an 

15 invitation to improve the content). 

CFM Matrix Daemon 

The Content Lifecycle Management Daemon compares the averaged ratings 
and the usage values of each record in the content management database against the 
20 administration-set values for both the rating and usage. The administration set values 

are the index values and allow the matrix to be configured to a particular group of 
users. If the rating and usage values fall above the index values for both ratings and 
usage, no notification is sent. If either the usage or the ratings values fall below their 
respective index value, a notification is sent. 

25 

Other embodiments of the invention will be apparent to those skilled in the art 
from a consideration of the specification or practice of the invention disclosed herein. 
It is intended that the specification and examples be considered as exemplary only, 
with the true scope and spirit of the invention being indicated by the following claims. 
30 What is claimed is: 
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1 1 . A computer-implemented method of maintaining currency and value of a 

2 collection of information, comprising: 

3 adding an item corresponding to a single piece of information to a database of 

4 metadata pertaining to the collection of information, where the added item includes 

5 an identifier indicating a location for the piece of information; 

6 a minimum value level for the piece of information; and 

7 a minimum usage level for the piece of information; 

8 accessing the added item in the database of metadata, where accessing 

9 includes 

1 0 * using the identifier to access the piece of information; 

1 1 updating a record of the actual usage level for the piece of information; 

12 and 

13 allowing updating of a record of the actual value rating for the piece of 

14 information; and 

15 performing an action in response to a condition in which 

16 the actual value rating is below the minimum value level; or 

17 the actual usage level is below the minimum usage level. 
18 

19 2. The method of claim 1 , wherein the item further includes an expiration date 

20 for the piece of information, and wherein an action is performed in response to a 

21 condition in which the actual date is within a selected time period from or is equal to 

22 the expiration date. 
23 

24 3. The method of claim 1 or 2, wherein the action is selected from the group 

25 consisting of: 

26 removing the item from the database of metadata; 

27 marking the item for archiving; 

28 placing the item in an archive database; and 

29 notifying a user that the condition exists. 

30 4. The method of claim 1, wherein the identifier is selected from the group 

3 1 consisting of uniform resource locators, file location paths, directories, subdirectories, 

32 and catalogue entries for nondigital information. 

6 
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